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study of actual language usage. A corpus of this type can provide: (а) 
mumber of long samples, which arc necessary to avoid skewing in the reliability 
and validity of linguistic counts; (b) text samples taken from several different 
text types, covering the range of textual variation in the spoken/written do- 
mains; and (с) а standardized data base that can be shared among scholars, so 
that individual studies can be replicated, and results across different studies 
can be directly compared (see Tottie et al. 1983:7). In addition, computer pro- 
grams permit an efficient analysis of a large number of linguistic features across 
alarge number of texts. In the present stud; 
in 545 text samples, totaling ovcr one 
Two separate corpora were used for the text samples. The first is the Lan- 
caster-Oslo-Bergen Corpus of British Written English (known as the LOB Cor- 
pus: see Johansson et al. 1978, Johansson 1982); this is drawn exclusively from 
printed sources published in 1961. It comprises 500 text samples of about 2,000 
words each, taken from 15 different genres—e.g. Press (Reportage), Mystery 


listed in the Appendix, below. The 
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nt linguistic features, 


potentially importa 
ed in terms of eight communi 


features are organizi 
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of passives and nominalizations. 
(b) Writing has a more elaborated style. 
and prepositional phrases. 
(c) Writing has a more expli 
ratio or precise vocabulary. 
(d) Writing has a more explici 


as in the use of subordinate clauses 
icit level of expression. as shown by type/token 
it marking of informational relations, €-8- cleft 
informal style of expression, as shown by 


eneral-reference pronoun it. 
tional features, e.g. by using 151 and 2nd 


(e) Speech uses а more inexpiic 
informal vocabulary items or the g 

(f) Speech refers more 10 interact 
person pronouns or questions. 

(8) Speech is more situated 
place and time adverbs. 

(h) Speech and writing differ in their us 
past and present tenses. 

The factor analysis discusse: 
as hypothesized in previo 
in terms of dimensions, 
patterns. It is necessary, 


in a physical/temporal context, as evi г 
proximately а mi 


se of verb tenses and aspects, e.g. the id 
1982, Svartvik & Quirk 1980). This is a collection of 87 spoken British English 


d in $3 shows that some of these features function texts of about 5,000 words each. The total corpus contains approximately 


but that other features must be re-analysed 
iow no systematic co-occurrence 
ide range of potentially im- 
derlying textual dimensions 


500,000 words representing several different speech situations—e.g. Conver- 
sation, Broadcasts, and Public speeches. 

Sixteen major text types, representing the full range of situational possibil- 
ities available in the corpora, were selected for analysis. The distribution of 
text samples in each text type is given in Table 1, overleaf.‘ 

‘The composition of some of these text types requires elaboration. ‘Press 
reports’ include several subclasses: political, sports. society, spot news. fi- 
nancial, and cultural. ‘Popular lore’ contains texts from popular magazines and 
books (e.g. Punch, Woman's Mirror, Wine and Food). "Official documents" 
are primarily governmental, but also include foundation reports. industry re- 
ports, and a section from a univer 
several subclasses, e.g. natural sciences, medicine, mathematics, social and 
behavioral sciences, humanities. and technology/engineering. Of the five fic- 
tional text types in the LOB corpus, two are included here: general and ro- 
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order to identify the um 
writing in English." 
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which allow significant improvement in the 
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For example, since there 15 no direct 
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{ihe untagged version of tl 
2. below). Because of the large nt 
‘grammatical construction can take, 
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way to identify all verbal forms, 1 used. 


1982) to represent а! э The LOB Corpus does nat include any examples of э! 


that purpose, [used computerized texts of tea professional eters, generously provided by 


^ Another large collection, the Brown Corpus, contains $00 written ten 
ut wos not used in the present study, because of the possible confounding influence of dialect 
erences (ec Biber 1964), The texts in all three of the large-scale corpora were produced by 
lass, university-educated adults, This coherence in the population und. 
‘of a confounding influence from social differences. but also highlights the need 10 


inted. In addition, this decision influences the 
fied through the presence of a verb (c.g, clefts and that 
is of many other counts, A description of the 
Quirk et al. 1972 was used as the standard 


ided in this analysis, but were not- e.g 
her features were noi included because 
ses and conjoined clauses (cf. Chafe 
whan structure (Grabe 1984). 


the frequency of different types of n 
they cannot be analysed automatic: 
1982) nnd features representing diffe 
Future studies should include anal 


words in length, were divided in half to be more closely comparable to the texts in the LOB Corpus 
Teach approximately 2000 words long! and to provide mare spoken samples, The frequency counts 
in all texts were standardized for a text length uf 2000 words, 


types of cohesion and informal 
js of both these sets of features, 
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‚ 3! linguistic features were counted 
ion words. 


n, and Learned Writings. The total corpus contains ap- 
lion words of running text.” 
he London-Lund Corpus of Spoken English (LL; Johansson 


catalog. "Academic prose" combines 


n interpersonal communication. For 


American English, 


y excludes 


rences across social parameters (see Poole 1979. Kroch & 982). 
the texts frum the London-Lund Corpus, which are approximately 5 
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їс features identified from earlier 


underlie English discourse; i.e., the 41 lingui: 
research are assumed to serve fewer than 41 separate communicative functions. 
A factor analysis identifies linguistic features that co-occur a high fre- 
quency in texts, and this co-occurrence is taken to indicate a common com- 
municative function shared by these features, Thus each grouping of features, 
or factor, can be interpreted by consideration оГ the communicative function 
most widely shared by the features. 

1n a factor analysis, a large number of original variables (in this case, the 
linguistic features) are reduced to a small set of derived variables (the factors). 
Each factor represents some amount of variation in the original data that can 
be quantitatively summarized or generalized—a grouping of variables that co- 
occur with а high frequency in the data. However, only the first few factors 
are likely to account for non-trivial amounts of the shared variance, and thus 
be worth further consideration. In the present case, it was determined that five 
factors account for non-triviat amounts of variance; these were hence retained 
for further analysis." 

Each factor is a simple summation of all the linguistic features, with different 
features having different weights (known as factor "loadings"). A restricted set 
of the linguistic features has salient weights on а given fact identifies 
these features as good representatives of the construct or textual dimension 
underlying the factor. For example, if the linguistic features in an analysis were 
past tense, 3rd person pronouns, relative clauses, and infinitives, a factor an- 
alysis might produce the following: 

Factor A = — .89 (past) + .61 (3rd pers.) + .10 (inf) — .19 0 h; 
Factor B = —.10 (past) + -29 (3rd pers.) + .56 (inf.) + .65 (relcl.) 
The number preceding each of the guistic features is the weight, or factor 
loading, of that feature for the factor; it indicates. the extent 10 which the feature 
represents the textual dimension underlying the factor. In the present analysis, 
features with weights smaller than .35 on a factor are not considered to be 
salient, and are not included in the interpretation of the factor.® Thus, in the 
above example, past tense (weight .89) and 3rd person pronouns (weight .61) 


з See Gorsuch 1983, Biber 1985 for a fuller discussion of factor analysis and its application to 
Five factors were relained on the basis of.a scree plot of the eigenvalues, which 

dear break between the fifth and sixth factors, The factors were subsequently rotated 

i ions among the faclors—since the 

sume orthogonal factors (see Gorsuch, 
150 ff.) The intercorrelations among the factors were generally quite small, except for Factors 
1-2(correlation of 58)and Factors 4-5 (correlation of 38). Threelinguistic features—style disjuncts, 
sosclefts, and split infinitives— did not have salient weights on any of the factors; this shows that 
These three 

al studies is 


they had no systematic distribution with respect to the other features included 
features were therefore dropped (rom the present analysis, and their use for ad 
in question. 

* Several methods exist to delermine the magnitud. 
analysis, depending on the number of observations in the analy 
the large number of observations in the present analysis. qui 
“significant”; but an absolute cut-off of .35 was used for the salient toadings. 


ignificant loadings in a facior 
jecause of 
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are the only important features for Factor А; and infinitives (weight .56) and 
relative clauses (weight .65) are those for Factor B. 

Table 2 summarizes the important features for each of the five factors derived 
through the present analysis. The decimal numbers listed after each linguistic 
feature show the actual factor loadings of the feature in question. 


Factor 1 Facror 2 Factor 3 

Features — yes-moquesons 79 nominalizations 74 past tense EJ 
with thar-clauses [76 prepositions 5 Sr person 
positive final prepositions 68 specific conjuncts .61 pronouns E 
weights proverb do a s passives .60 perfect aspect. A 
grenier contractions E by-passives AT 
tham 3$ 7 Шуои el 45 

general hedges — 6l day — 4 

{шше 56 attitudinal disjuncts .35 

wa-questions E (word length) E 

pronaun it a) 

other adverbial 


subordinators — 48 
specific emphatics 46 


demonstrative 
sew a 
wu-clauses A 


general emphatics Al 
(present lense) — 42 


(infinitives) 35 
Features — wordlengh —.7i Расе adverbs. -.57 -e 
with typetoken tio -.65 time adverbs —.55 -0 
negative relative pronoun 
weights deletion EE 
greater subordinator saz 
than 35 deletion =a 
Grd person 
pronouns) — -35 


Tamaz 2. Summary of the factorial structure of 41 linguistic features. (Features in parentheses are 
repeated loadings, and are not used in the computation of the factor scores—sce $4. 
Features loading on Factor 4: relative clauses .65, infinitives 56, (wirclauses .39), (present 

tense 38). 
Features loading on Factor 5: other adverbs .69, specific hedges 39. 
Features dropped from the analysis (no salient weights): style disjuncts, wie-clefts, split infinitives, 


The negative and positive clusters on a given factor represent two groups of 
complementary measures.” That is, when the features with positive weights 
occur together frequently in a text, the features with negative weights are mark- 
edly less frequent in that text, and vice versa. Taken together, the positive and 
negative weights represent opposite poles defining an underlying textual di- 
mension. Consider Factor 3: the positive weights are past-time features (past 


? positive vs. negative weights on а factor do Nor relate to the importance of those features to 
the factor. On Factor 3. for example, present tense, with a weight of = .62, is more important than 
perfect aspect. with a weight of 47. 
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clefts also serve to mark the informational relations among different compo- 
nents of a text. Attitudinal disjuncts and adverbs occurring with split auxiliaries 
(e.g. He was obviously working hard) have lower weights on this factor; they 
apparently serve to mark the author/speaker's attitudes in texts having a highly 
abstract content. In addition, all the features with positive weights are asso- 
ciated with a high degree of formality and a leamèd style. 

In contrast, the features with negative weights on Factor 2 share the marking 
of very concrete content and more informal style, indicated by high reference 
to the temporal and physical situation—by means of place and time adverbs— 
and reduced surface form, through deletion of relative pronouns and subor- 
dinator rhat. Place and time adverbs refer directly to an external situation, 
clearly marking a more concrete, situated content. Deletion of relative pro- 
nouns and subordinator thar mark a reduced correspondence between the sur- 
face form and underlying meaning; they are associated with less formal styles, 
and with speech more than writing (Finegan & Biber 1983). They thus reflect 
а greater reliance on an external jtuation than more deliberate styles. Con- 
sideration of the features with positive and negative weights suggests the label 
"Abstract vs. Situated Content’ for the dimension identified by this factor— 
i.e. n detached, formal style vs. a concrete, colloquial one.” 


3.4. INTERPRETATION OF FACTORS 3-5. For Factor 3, the features with pos- 
itive weights (past tense, perfect aspect, and 3rd person pronouns) can all refer 
to a removed, narrative context; those with negative weights (present tense 
and adjectives) can be used for more immediate reference. The co-occurrence 
of adjectives with the present tense apparently indicates the presence of more 
elaborated content in present-time descriptive or expository texts than 
time narrative texts; however, this feature needs further study. Оче 
ion identified by this factor distinguishes texts with a primary narrative 
emphasis, marked by considerable reference toa removed situation, from those 
with non-narrative emphases (descriptive, expository, or other), marked by 

le reference to a removed situation but by high occurrence of present tense 
forms. These characteristics suggest the label ‘Reported vs. Immediate Style". 

Factors 4-5 are more ‘cult to interpret than the first three. Factor 4 has 
only four features with salient weights, and two of these (wH-clauses and 
present tense) have larger weights on other factors. Factor 5 has only two 
features with salient weights. Thus neither factor is well represented, and each 
must be interpreted cautiously. On-going research is considering other mea- 
sures in relation to these two factors, to test their importance and the validity 
ns suggested here. 

The communicative function shared by the features on Factor 4 (relative 


* Tbe present study shows th: 
amount of personal involvemei 
1-2 show that those associated with personal involvement a 
tion (or detachment) belong 10 separate textual dimensions, although the t 
elated, Professional letters, discussed in 13, illustrate highly abstract texts with a high level of 
personal involvement. 


SPOKEN AND WRITTEN TEXTUAL DIMENSIONS 
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clauses, infinitives, wi-clauses, and present tense) seems to mark an "integra- 
tive" type of subordination (cf. Chafe), as opposed to that associated with real- 
time production constraints seen in connection with Factor |. That is, this type 
of subordination may be used to package a high amount of information into a 
text; it is characteristic of ‘static’ rather than 'dynamic' texts (to use Halliday's 
terms). If this interpretation is correct, we would expect that features which 
have been hypothesized as being integrative (e.g. participles) should со-оссиг 
with the features of Factor 4, whereas features hypothesized as opposing in- 
tegration (such as conjoined clauses) should occur in a complementary pattern. 
The communicative function shared by the features on Factor 5 (adverbs 
and specific hedges) seems to mark the author's or speaker's stance їп a text. 
Specific emphatics have a weight of .32 on this factor—too low to be considered 
salient, but in line with the stated interpretation. Linguistic features which 
might mark author's stance also occur as parts of Factor 1 (general hedges, 
general emphatics, and specific emphatics) and of Factor 2 (attitudinal disjuncts 
and adverbs occurring as split auxiliaries); this indicates that the notion of 
stance is complex, and requires further research (sec: Biber & Finegan 1985). 
The interpretations of the dimensions underlying these factors are open to 
refinement, and require further vali n. As we learn more about the com- 
municativc functions of specific linguistic features, the emphases of some in- 
terpretations may shift. The interpretations given above for the last two factors. 
must be considered speculative, since they are not well-represented. For Fac- 
tors 1-3, however, the groupings of features are quite stable (see the partial 
replication of this study reported in Biber 1984); thus we can have confidence 
in the claim that important textual dimensions are being represented here— 
‘ones that will be useful for defining relations among spoken/written text types. 


A UNIFIED MODEL 

4.1. OVERVIEW OF FACTOR SCORES. In §3, we discussed interpretations which 
result from the factor analysis. In Step 2 (83.1), derived variables that opera- 
tionally represent the textual dimension underlying each factor can be com- 
puted, These derived variables, known as FACTOR SCORES, allow further inter- 
pretation of the textual dimensions by examination of the similarities and 
differences among the text types with respect to each dimension. 

A factor score is computed by summing. for a given text, the number of 
occurrences of the features having salient weights on that factor. Thus the score 
for Factor 3 would be computed by adding the number of past tense forms, 
perfect aspect forms, and 3rd person pronouns (i.c. the features with positive 
weights), and then subtracting the number of present tense forms and adjectives 

е. the features with negative weights). For example. one of the fictional texts 
in this study has 156 past tense forms, 117 3rd person pronouns, 24 perfect 
aspect forms, 46 present tense forms, and 88 adjectives, resulting in the fol- 
lowing factor score for Factor 3: 

(156 + 117 + 24) — (46 + 88) = 163 
Some of the linguistic features have salient weights on more than onc of the 
factors (c.g. word length on Factors 1-2); to assure the experimental inde- 
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осле 3. Mean scores for Textual Dimensic 


and press reports show the lowest. Text types with high scores on this di- 
mension are characterized by frequent occurrences of questions, Ist and 2nd 
contractions, pro-verb do. pronoun it. that-clauses, if-clauses 
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Romantic and General fiction 


{ 
l 
+ 
| 


Planned speeches, Press reports. 
Bolles-lettres 

Spontaneous speeches 

Poputar lore 


Broadcasts, Face-to-face conversation 


їс prose 
documents 
conversation, Interviews, Editorial letters 


Professional letters. 


(F = 29.48, p< 0001, RR = 45.5%). 


ive weights on Factor 1 


10 Text samples are labeled as follows: 
CORPUS TEXT-TYPE TEXT. NUMBER 


For example, Text Sample 1 
type 1 face-to-face conversat 
line represents an intonation unit. 


labeled LL: 1.4. bes 
), and is (ext 4 wi 


3: Reported vs. Immediate Style. 


lusirated by thc 


from the London Lund Corpus, text 
pe. In the spoken text samples, cach 


and so the others mmm 
the others sort of feel 
that things won't go on much longer 

А: well they really haven't any reason to 
because 1 mean finalists are 

В: mmm 

A: and they actually do finish 

В: exactly 
of course they do 

А: and the others don't 


and I'm not in a main line paper 
‘but I'm sure it" take me all my time to do it 
in three weeks 
Lean I've seen what ji's been like for you 
T know ... had more 
ов the other hand 
T must allow myself good time 
the first time 1 do it 

А: 1 don't think T'm going to go on with it 

В: are you doing two or one paper this year 


Shed ia the Calendar apply to all students. Students 
"opening of a session ond research students! аге required to arse 
except with the special permission of the Dean of their Faculty- 
РАВ ist and last days of term as published are regarded as 
Il be held .. 

characteristics of texts having 
well. really. 


attitudes (the others 
1 know [that] this is 
the opposite characteristics: many long words and а qui 
resulting in an explicit expres 

to ..., regulations for the maintenance ... are 
few generalized content terms, self-reference terms, 


manner creditable 
-..)—coupled with 
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ional letters; see fn. 3). 

ves, I concede, border on programmatic acti 
the develapment of specific policy/programmatic recommenda- 
board of directors in October and the bourd was informed of 


Техт Saute 4 (Profei 
iy, ft should be stressed that 


they were taken pursuant 
tions for consideration by the 
them in our report of May. It should also be understood that 
educational exchange threatened when 1 hear optimistic fore 
foreign students will remain bev 
sertion of my claiming that good 
has ever viewed his program as orientation 


sts? Perhaps we can agree th 
ween 2 and 3% of total enrollment, 1 disagree with your as- 
‘extended orientation programs help: 1 doubt that ххх аххх 


and indeed that is part of my disenchantment wi 


characteristics of fiction; apart from the dialog 
forms, and it has a high frequency of 3rd person 
trast, is written consistently in the present tense 
butions reflect the dif- 


Sample 3 shows the typical 
sections, it uses only past tense 


from the clustering of 
can be used for description of events in progress 
versation) or for expository purposes (as in acade: 
letters). The distribution of text types along Dimension 3 is thus in agreement 
with the label ‘Reported vs. Immediate Style’ suggested in $3. 


4.5. THE NEED FOR A MULTI-DIMENSIONAL ACCOUNT OF SPEECH AND WRIT. 
Inc. We have briefly considered the relations among spoken/written text 1уреѕ 
with respect to each of the three textual dimensions identified by the factor 
the dimensions are separate: cach represents dis- 


analysis, We have seen thal 
rate set of similarities 


tinet communicative functions, and cach identifies a sepa 
and differences among text types. If the 
types were considered in terms of only one dimension, a necessarily 
description would result. 
Consider the description of 
dimensions. Academic prose. 
nearly identical with respect to Dimension 
sionat letter) shows a high frequency of nominalizations (initi 
ment, recommendations, consideration eic. 
taken, was informed et 
of specific ... for consideration by the board ...) This sampl 
al content found in professional letters, 
documents, But with respect to Dimension 1, 


lations among spoken/written text 


Г professional letters with respect to the three 
official documents, and professional letters are 
Thus Sample 4 (from a profes- 


of passives (be stressed, were 
1 phrases (го the development 


), and of prepo: 


highly abstract information: 
like academic prose and offi 
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these text types are quite different. Professional letters show high type/token 
ratio and use of long words, as do academic prose and official documents; but 
they show considerable personal involvement and interaction with the reader. 
Sample 4 shows frequent Ist person reference and high use of subordinate 
clauses to express personal feelings (should be stressed that ..., should also 
be understood that ..., 1 doubt that ...)—plus frequent interactive features such 
as 2nd person pronouns and questions Us the basis of ...?) Finally, as shown 
above, professional letters show the lowest score (or the most ‘immediate’ 
style) for Dimension 3. Thus a consideration of any of these dimensions alone 
would result in an inaccurate description of professional letters. If we consid- 
ered only the features associated with Dimension 2, we might conclude that 
professional letters were indistinguishable from academic prose; if we consid- 
ered only Dimension 1, we might conclude that professional letters were very 
similar to planned speeches or broadcasts. An adequate description of a text 
type and its relations to other text types requires a consideration of that text 
type with respect to all three dime IS. 

‘The need for all three textual dimensions can also be seen from other com- 
parisons. Planned speeches show a pattern ional letters, 
in that both permit considerable personal involvement and interaction—as 
j rely high scores on Dimension 1, Both also contain quite 
abstract content and are not highly situated (this is truer of professional letters 
than of planned speeches); thus they show high scores on Dimension 2. Sample 
5 illustrates these characteristics in a planned speech: 


Texr Sampe 5 (LL:12.5, Planned speeches). 
does anyone believe 
that we would have accepted for the seventies 
а degree of freedom of capital movement 
that would have aggravated that power of speculative attack on sterling 
which we had to fight ir 
if Mr. Barber 
with inherited 
Labour exchange controls 
had to admit to а thousand million loss. 
through a run on sterling 
in six days last June 
could any Labour government have agreed ... 
can see ten years ahead. 
^" going to tell us. 
in the next len weeks 


you reject the right of the people to decide. 
no other resolution 

adequately provides 

ish people having the last. word 
the right of self-determinstion 


Here we see frequent изе of the features of interaction and involvement 
associated with Dimension 1 (does anyone believe that we ...2; could апу... 


зу змәшоэ oi ss рга su srarawesed 2524} OF ра! 
"uegia wy saumpes] asoq Jo ээшэза au] ava jou sr Фата sui jo 1991 ЭЧ ENON 
Ат ол pamen эле siuapmig (3303151) pe 123e3ds 20) Japval рше 101m0 23439 
pom aq ues wapa "20415 peuossadiust Afi грашо! е Sufiuiqeiso 
muou pue saatssed jo ээп aq smu, "uawas 244 JO S59U]DENSGE 2i 01 
ejoa Áqqea odd st (səd 1x33 2410 ЭЦ jo әшоз ш 

таман эр| samea jo 220259 OQ zy 


эзир eos 


-uo3 paaiuarend А 
eui "еу wy ‘Sunum pue 422545 4994129 diysuonejas 241 8014425003 stioisnio 


apeos-os; Jo Аинаецеле 241 
чәе sixa) 556 ZuIpnjour ‘Apms 11125244 ay) Jo adoas əpim эчт “әзошзәцип 
soipnis 1ә1И®ә jo sSurputj рәјгејор эці motn 2Jqeraaduonu! аза элт 00 
pinom Apmis 1125224 24} ur paureiqo 51551 эц, spaumbar зем 51X9) [EOPIAIP 
-ш ur $92022} 215118! pur jo sisÁ[pue pajejap aym *ta183523 Aow 
оа ло} renuosso 219m suonauisa 9534, 'SlJns23 Э4 рә1зшзал—зэшеәгүи 
2315 Jo 5159) SyeUBAIUN эрпш 20 ‘534055 шеәш ‘5212421624 jo 1054102 
panp—pasn Ádi) 219% (are врощәш pensines ayy uonsppe uy `5әЧА} 
1x2) jo pue 'ssjdures 1521 30 'sainseour ansinguy jo Ayoned :soipms Әл 
-uenb sapea Ашеш azuajaereuo Чэна рәуіиэр! aram 51011211534 221] 1$ UT 
=цовәѕәз jsed шоу) pagtawa sey org Зипим pue 432245 19248124 drusuon 
-ejas aq jo anod pasnyuo> әш 40] палі 2q ue» uoneuejdxa 21815 ON "S. 


SRIV 12 915714502 JO NOLLVETIONOJ3 


. 'Suoisuaunp 
qgnixa) гәл [Te JO uorg1ado оц jo uoneIapisuo> snoouujnibis sanba 52941 
usxodsusnium #шаше suonejar əy) Jo үәрош 1[2-22л0 шү *suotsuounp 12410 
оу 122dsa: it^ jouristp anb oq 01 UMOYS are uorsuatutp 200 Зиоје am 
je sadÁíi 1x9) ‘52582 əsayı цв UT 'эреш 2q pno? suosuedwos tons 12410 
“пођишоуа jo uore1ussaad А101506%2 10 55223014 ш Ajyenjoe 5242 
Jo wonduosop зация зало ъїйәлә рәнойәз 20} aouarsjard Juans ви Aq sədA 
vaniuauaxods 22410 п шолу jsp әд 0) UMOYS 51 uonay повшәш1@] Suore 
“(ano8np эг о dn 8и “і apisaq ‘play з} шолу vt Sunuoo 'лойпр 24 
ou :g ajdureg ш BLOpens qejoduxajeoisÁqd в 01 $230213]91 Аитш atr 
эи) ameu рәепиз AYI sy Sv Пом S8 „1 (Є ajdureg vi заодетцешшоп pus 
samssed Jo o3uosqe au] 2100) uon: ш 1097002 19011997 10 428] 341 SMOYS STUL 
"sod uayods oi Jo K1trofew эці que Surdnosd ‘521095 №0] Adan 5123 опо 'Z 
uoisuaung Buoy "Ápnis 3495281 og Jo sadures xa) 94) ш Soperp [500120 jo 
повар oq o1 ‘wed uy paras oq ew uounswayur цац ou "(02861 u2uue у. 
Jo) iu2ursAqoAur-[auos12d ajqeiapjsuo2 yum “әлә! цат ing "(sitiens 


а» SNOISNAWIG TV NLL маши ANY N3DIOdS. 


„uos vopanposd әшиҗвәл ou Burry) Зшцеош jo worssaidxo $ 
Ид sn) st 1 (52211 19401550304 #шрпзкә) 62041 Auousodxa a 
тәцї v seu inq ‘ойд! 1X2] UOT әцу Jo SOL yim puo 124401 әт pemo 
sanaoo uon *| uosuawg Suojy "posapisuoa азон, шозиәшр ашо дао И 
ajenbopeu] әд pinom чопту Jo 539$ 2126 4 oui јо uondussop e “Аеш. 
"үшәшоз parens 414810 ‘21922000 € Suey ш $2441 1x2) uayods Kueur 
ye jnq—1u2tu234[0A! jeuos12d 20 иопэтәш api йип uad u; 5241 1X9} 
i Auau ayy әле siseapeosg Tey moys or ролл st g pue ] 500809 
ча 4109 jo uonerapisuos snup “(yong бом ‘joss әці 0) 241022. әу Ш 'ams 
рию ays uo 4240 nf314) uonenits. 1езкзАчалезо@ ша 241 01 5320219721 ъполәшпи 
mos п pue ївәзтица пемориѕойалӣ 10 ‘sanissed 'виопсецешшоп maj $44048 
9 3jdureg ‘рошпиз put 3123405 Ayysiy Sutoq ш 59061 1021 ayads jo &iuofeur 
эці о uis annb aq 01 sistoptoiq smoys g uorsuawig Jo wongiapisuoo в 
"ioxomoH "(SpiOA jo uonnadas цац pue /av]nqeaga jo auta palotrisas 241 а 
имоцх) $по!лдо aimb 210 511215002 wononpoud awn- ч8поцие ‘изшамюл 
[2405524 Jo вчопеәїрш 10 sanies) {20005742 ay SMOYS ajdwes siu T, 
зәли э водит 
эш ope pur 
ry poy оз 8408 inf s} подіхо йпю em 1400) М pus 
ед Кем sm Suniy zasrey and 
эри лу эң чо под 5,4 PIE 
^ mou зәрез{ эф АЕ SUL S.I put 
шшш эң ор эшо saspe 2u рит apis т aq шо Jano aunssaud Jopum FORD 
pos Uno) абое st apis pueys эцу uo pue 
асо зэцэ ap ш ЕН ang $1 anua 341 ur 
pueqasou шрүзбэәцъ эң up apis леу эй по 2240 под) T. PE 
тазитш эшоц шоу Фоорту ола ayi 01 
1150 звать ы 
apis putas эца uo 3240 би pue 
pis purets au) uo тәк aud 
подло mof oj dn Зщшоз s оцт aos 314 6,11 pow 
дктэртога "Ep'OLTT) 9 34RVG DEL 
<р uoisuaung o1 122ds21 qs 5594} 
1x21 оным 1вош о; епш дип aq 01 aeadde 'g 210шес̧ ur se *51582реоЈн 
"sadÁ1 хәт 25941 usimsunsip 01 ранпьза sp suotsusu 
4p aos пе jo пополориеиоз "зочл, "(npo оў POY ‘yf 01 poy әм ‘paid2290 
әлү ртом 782) sunod Kiousodxə әці jo Loddns ani siU2A9 1584 50421 
эзиз ised aja '(sapiaoad 77 12294 noi panafaa 8р7" Л "6 ajdurg шол 
“fray wawas 42011500хә sKaAu02 25091 14255214 “sunoj ised pue quasaid yyoq 
asn soypaads рәцит@ ayas *з1иәлә рәинойә jo worsnjax2 20} pure шо] 3509} 
1125244 10у әзиәләуәзй Зи0л$ € 2404$ 5191191 qguoissajoad ^£ шоиәш(1 Suore 
“золәтоц ‘1521632 $1 sad 1 1X91 ОА SY] 13544199 aauaiSpip au f, (PAYD sata 
ua) 'aunp 1х0} s&np xis) 5 ojdures ш uoneniis fooisdéydyusoduiay 241 0) 62209 
зари anp э41 Japisuod *ajdurexo 10,4 "s19119| 24} Vey) parenus Ад эош 
эч 01 s2u222ds ац 8ш\м\оцв ‘51251 reuo;ssajosd pue 52422345 pauwejd шому 
-9q usingunsip от suioq z vosuawq *(uopmuzuuatap fleas *иоцпотә1 “разаГ 
221 єз "pior aq) 1091000 TaENSAe ue реш Ҷощ ‘Z шовшәша] Чиа 6210120558 
suonvzijpururou рив 59415594 эці Jo sJ? 1ng— (ро! 29 ам 1,u02 бум 14777 дару 


(9861) т UABWON 'C9 яиїплол 3 VT DNVT sor 


408 LANGUAGE, VOLUME 62, NUMBER 2 (1986) 


clusions have frequently resulted from comparison of different text types with 
linguistic measures taken from different textual dimensions. 

The resolution of contradictory global conclusions can be illustrated through 
a comparison of Chafe 1982 and Blankenship 1962. These admittedly prelim- 
inary studies (see Chafe & Danielewicz 1985, Blankenship 1974) are used here 
because their findings are illustrative of the problem. Chafe found large lin- 
guistic differences between speech and writing by comparing conversation and 
academic papers along the two hypothesized dimensions of integration/frag- 
mentation and involvemenUdetachment. By contrast, Blankenship 1962 con- 
cluded, from a comparison of popular journal articles and public speeches, that 
there was no over-all difference between speech and writing with respect to 
the measures of sentence length, past tense forms, and passives. 

Chafe's text types strongly biased his study in favor of finding a spoken! 
written distinction, since conversation and academic prose show polar dis- 
tinctions along Dimensions 1 and 2 identified in the present study: Interactive 
vs. Edited Text, and Abstract vs. Situated Content, If conversation is taken 
to represent speech, and academic prose to represent writing, then most lin- 
guistic features considered in previous research could be presented as evidence 
for a spoken/written distinction. 

Blankenship compared text types which were more similar (popular journals 
and prepared speeches)—but with respect to linguistic features which, as I 
have shown here, are taken from separate textual dimensions: past tense clus- 
ters with perfect aspect and 3rd person pronouns on Dimension 3 (Reported 
vs. Immediate Style), while passives cluster with nominalizalions and prepo- 
sitional phrases on Dimension 2. Blankenship found nearly the same number 
of passives in writing and speech, and more past-tense forms in speech. These 
results are confirmed by the findings in 84, above: popular lore and planned 
speeches have nearly the same value on Dimension 2, and planned speeches 
than popular lore on Dimension 3. However, we cannot 
accept Blankenship's global conclusion that the reverse results for these two 
features demonstrate the unimportance of the spoken/written distinction. That 
is, the relations among text types differ along cach of the textual dimensions 
described above in 54: and the only major di between the two text 
types used by Blankenship is along Dimension 1—which is not represented in 
her selection of linguistic features, 

More specific contradictions can also be resolved with the over-all model 
developed here. Thus Chafe found more passives in writing (academic papers) 
than in speech (conversation); Poole & Field, however, found few passives in 
either writing (narratives) or speech {interviews). The distribution of these text 
types along Dimension 2, which includes passive constructions, confirms both 
pairs of findings: academic prose has one of the highest scores, and conver- 
sation one of the lowest: fiction (the text type in the present study most sii 
to ‘narrative’) has an intermediate score, as do 

Contradictory findings concerning the extent of suboi 
writing are more difficult to resolve because the function of subordination is 
more varied and complex than these other linguistic features. In terms of the 


it 
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over-all amount of subordina in a text, Kroll, O'Donnell, and Chafe found 
more in writing; but Halliday and Poole & Field found morc in speech. Beaman, 
who generally found more subordination in speech, perceptively noted that 
different types of subordination are present in the two modes." The differences 
among these findings are influenced by several parameters. For example, the 
text types chosen for comparison vary widely between studies: while O'Donnell 
looked at only onc television interview and onc newspaper opinion column 

Chafe compared conversation and academic papers, and Beaman compared 
spoken/written narratives. 

Equally important is the fact that the subordination measures used in earlier 
studies are different and not directly comparable. As I have shown, that- 
clauses, if-clauses, wHt-clauses, and other subordinators (i.e. adverbial clauses) 
function as part of a single dimension; but relative clauses have a separate 
communicative function. Infinitives have been grouped as parts of two different 
factors, and thus may have yet another communicative function. Assessments 
of over-all subordination which indiscriminately lump these measures together 
can be expected to produce contradictory results. 

When individual subordination measures are considered separately, the find- 
ings are less contradictory; agreement with my analysis, most previous 
studies have found more thar-clauses in speech (Beaman 1984, Frawley 1982, 
Jorgensen 1978, and even O'Donnell 1974). However, nearly all studies have 
shown relative clauses to occur more frequently in writing (Frawley, Kroch & 
Hindle, O'Donnell, Chafe); they are distributed differently from. that-clauses, 
as shown by their clustering on separate dimensions in the present analysis. 
Finally, although O'Donnell and Beaman found more adverbials in writing (an 
apparent contradiction to the clustering of "other adverbial subordinators' on 
Factor Vin my analysis), Beaman's specific findings for speech agree well with 
mine: more condition adverbials (comparable to (f-clauses here) and reason 
adverbials (c.g. though, although, because, and since—which are several of 
the primary tokens in my category ‘other subordinators’). 

К We have seen that subordination features function as part of different textual 
dimensions, and that they serve differing functions in different text types; con- 
trast shat-clauses on Dimension | with relative clauses on Dimension 4. More 
detai lcd study of subordination features as they function in different text types 
is required before final conclusions can be drawn concerning their over-all 
distribution and functions. 

This section has shown that the global conclusions reached in previous ге- 
search are contradictory because the text types chosen for comparison were 
too similar or too different; because the linguistic features chosen belonged to 
different textual dimensions; and because researchers relied on inadequate ana- 
lytical techniques. The analysis presented in §4, above, more adequately rep- 
resents the complex relations among spoken/written text types in English, and 


distributions. Pellegrino & Scopesi 1978 found the 
amount of subordination in spoken and written Italian, as did Jörgensen 1978 for 
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